Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix rendering dbt tests with multiple parents #1433

Merged
merged 8 commits into from
Dec 30, 2024
Merged

Conversation

tatiana
Copy link
Collaborator

@tatiana tatiana commented Dec 27, 2024

If these two circumstances are met:

  1. The dbt project has tests that rely on multiple parent models and;
  2. The DbtDag or DbtTaskGroup use TestBehavior.AFTER_EACH (default) or TestBehavior.BUILD

Cosmos 1.8.0 and previous versions would attempt to run the same test multiple times after each parent model run, likely failing if any of the parents hadn't been run yet.

This PR aims to fix this behaviour by not running tests with multiple dependencies within each task group / build task - and by adding those tests to run only once and after all parents have run.

Related issues

Closes: #978
Closes: #1365

This change also sets the ground for adding support to tests that don't have any dependencies, a problem discussed in the following tickets:

How to reproduce

There are two steps to reproduce this problem:

  1. To create a representative dbt project
  2. To create a Cosmos DbtDag that uses this dbt project to reproduce the original problem

Representative dbt project

We created a dbt project named multiple_parents_test that has a test calledcustom_test_combined_model that depends on two models:

  • combined_model
  • model_a

The expectation from a user perspective is that, since the combined_model depends on model_a, that the multiple_parents_test will only be run after both models were run, once.

Definitions of the test:

{% test custom_test_combined_model(model) %}
WITH source_data AS (
    SELECT id FROM {{ ref('model_a') }}
),
combined_data AS (
    SELECT id FROM {{ model }}
)
SELECT
    s.id
FROM
    source_data s
LEFT JOIN
    combined_data c
    ON s.id = c.id
WHERE
    c.id IS NULL
{% endtest %}

By running the following dbt build command, we confirm that the test depends on both models:

dbt build --select "+custom_test_combined_model_combined_model_"
11:59:29  Running with dbt=1.8.2
11:59:29  Registered adapter: postgres=1.8.1
11:59:29  Found 3 models, 6 data tests, 414 macros
11:59:29  
11:59:30  Concurrency: 4 threads (target='dev')
11:59:30  
11:59:30  1 of 9 START sql view model public.model_a ..................................... [RUN]
11:59:30  2 of 9 START sql view model public.model_b ..................................... [RUN]
11:59:30  1 of 9 OK created sql view model public.model_a ................................ [CREATE VIEW in 0.18s]
11:59:30  2 of 9 OK created sql view model public.model_b ................................ [CREATE VIEW in 0.18s]
11:59:30  3 of 9 START test unique_model_a_id ............................................ [RUN]
11:59:30  4 of 9 START test unique_model_b_id ............................................ [RUN]
11:59:30  4 of 9 PASS unique_model_b_id .................................................. [PASS in 0.05s]
11:59:30  3 of 9 PASS unique_model_a_id .................................................. [PASS in 0.06s]
11:59:30  5 of 9 START sql view model public.combined_model .............................. [RUN]
11:59:30  5 of 9 OK created sql view model public.combined_model ......................... [CREATE VIEW in 0.03s]
11:59:30  6 of 9 START test custom_test_combined_model_combined_model_ ................... [RUN]
11:59:30  7 of 9 START test not_null_combined_model_created_at ........................... [RUN]
11:59:30  8 of 9 START test not_null_combined_model_id ................................... [RUN]
11:59:30  9 of 9 START test not_null_combined_model_name ................................. [RUN]
11:59:30  7 of 9 PASS not_null_combined_model_created_at ................................. [PASS in 0.07s]
11:59:30  9 of 9 PASS not_null_combined_model_name ....................................... [PASS in 0.07s]
11:59:30  8 of 9 PASS not_null_combined_model_id ......................................... [PASS in 0.07s]
11:59:30  6 of 9 PASS custom_test_combined_model_combined_model_ ......................... [PASS in 0.08s]
11:59:30  
11:59:30  Finished running 3 view models, 6 data tests in 0 hours 0 minutes and 0.50 seconds (0.50s).
11:59:30  
11:59:30  Completed successfully
11:59:30  
11:59:30  Done. PASS=9 WARN=0 ERROR=0 SKIP=0 TOTAL=9

This is what the pipeline topology looks like:

Screenshot 2024-12-27 at 11 39 31

The source code structure for this dbt project:

├── dbt_project.yml
├── macros
│   └── custom_test_combined_model.sql
├── models
│   ├── combined_model.sql
│   ├── model_a.sql
│   ├── model_b.sql
│   └── schema.yml
└── profiles.yml

When running dbt ls, it displays:

dbt ls
11:40:58  Running with dbt=1.8.2
11:40:58  Registered adapter: postgres=1.8.1
11:40:58  Unable to do partial parsing because saved manifest not found. Starting full parse.
11:40:59  [WARNING]: Deprecated functionality
The `tests` config has been renamed to `data_tests`. Please see
https://docs.getdbt.com/docs/build/data-tests#new-data_tests-syntax for more
information.
11:40:59  Found 3 models, 6 data tests, 414 macros
my_dbt_project.combined_model
my_dbt_project.model_a
my_dbt_project.model_b
my_dbt_project.custom_test_combined_model_combined_model_
my_dbt_project.not_null_combined_model_created_at
my_dbt_project.not_null_combined_model_id
my_dbt_project.not_null_combined_model_name
my_dbt_project.unique_model_a_id
my_dbt_project.unique_model_b_id

Behavior in Cosmos

The DAG example_multiple_parents_test uses this new dbt project:

import os
from datetime import datetime
from pathlib import Path

from cosmos import DbtDag, ProfileConfig, ProjectConfig
from cosmos.profiles import PostgresUserPasswordProfileMapping

DEFAULT_DBT_ROOT_PATH = Path(__file__).parent / "dbt"
DBT_ROOT_PATH = Path(os.getenv("DBT_ROOT_PATH", DEFAULT_DBT_ROOT_PATH))

profile_config = ProfileConfig(
    profile_name="default",
    target_name="dev",
    profile_mapping=PostgresUserPasswordProfileMapping(
        conn_id="example_conn",
        profile_args={"schema": "public"},
        disable_event_tracking=True,
    ),
)

example_multiple_parents_test = DbtDag(
    # dbt/cosmos-specific parameters
    project_config=ProjectConfig(
        DBT_ROOT_PATH / "multiple_parents_test",
    ),
    profile_config=profile_config,
    # normal dag parameters
    start_date=datetime(2023, 1, 1),
    dag_id="example_multiple_parents_test",
)

When trying to run it using:

airflow dags test example_multiple_parents_test

Users face the original error because the test is being attempted to be run after model_a was run but before combined_model is run:

Screenshot 2024-12-27 at 12 10 36

Excerpt from the logs of the failing task:

[2024-12-27T12:07:33.564+0000] {taskinstance.py:2905} ERROR - Task failed with exception
Traceback (most recent call last):
  File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/venvpy39/lib/python3.9/site-packages/airflow/models/taskinstance.py", line 465, in _execute_task
    result = _execute_callable(context=context, **execute_callable_kwargs)
  File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/venvpy39/lib/python3.9/site-packages/airflow/models/taskinstance.py", line 432, in _execute_callable
    return execute_callable(context=context, **execute_callable_kwargs)
  File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/venvpy39/lib/python3.9/site-packages/airflow/models/baseoperator.py", line 401, in wrapper
    return func(self, *args, **kwargs)
  File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/cosmos/operators/local.py", line 796, in execute
    result = self.build_and_run_cmd(context=context, cmd_flags=self.add_cmd_flags())
  File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/cosmos/operators/local.py", line 654, in build_and_run_cmd
    result = self.run_command(cmd=dbt_cmd, env=env, context=context)
  File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/cosmos/operators/local.py", line 509, in run_command
    self.handle_exception(result)
  File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/cosmos/operators/local.py", line 237, in handle_exception_dbt_runner
    raise AirflowException(f"dbt invocation completed with errors: {error_message}")
airflow.exceptions.AirflowException: dbt invocation completed with errors: custom_test_combined_model_combined_model_: Database Error in test custom_test_combined_model_combined_model_ (models/schema.yml)
  relation "public.combined_model" does not exist
  LINE 12:     SELECT id FROM "postgres"."public"."combined_model"
                              ^
  compiled Code at target/run/my_dbt_project/models/schema.yml/custom_test_combined_model_combined_model_.sql

Behaviour after this change

With this change, when running the DAG mentioned above, it results in:
Screenshot 2024-12-27 at 15 44 17

And it can successfully be run.

Breaking Change?

This PR slightly changes the behaviour of Cosmos DAG rendering when using TestBeahavior.AFTER_EACH or TestBeahavior.BUILD when there are tests with multiple parents. Some may consider it a breaking change, but a bug fix is a better classification since Cosmos did not support rendering many dbt projects that met these circumstances.

The behaviour change in those cases is that we're isolating tests that depend on multiple parents and running them outside of the TestBehaviour.AFTER_EACH dbt node Cosmos TaskGroup or TestBehaviour.BUILD.

This change will likely highlight any tests that depended on multiple models and were not failing previously but running as part of the tests of both models.

Copy link

netlify bot commented Dec 27, 2024

Deploy Preview for sunny-pastelito-5ecb04 canceled.

Name Link
🔨 Latest commit 5d0cece
🔍 Latest deploy log https://app.netlify.com/sites/sunny-pastelito-5ecb04/deploys/676ee25ce41dee0007cd7eac

Copy link

cloudflare-workers-and-pages bot commented Dec 27, 2024

Deploying astronomer-cosmos with  Cloudflare Pages  Cloudflare Pages

Latest commit: 5d0cece
Status: ✅  Deploy successful!
Preview URL: https://274792e1.astronomer-cosmos.pages.dev
Branch Preview URL: https://tests-multiple-parents.astronomer-cosmos.pages.dev

View logs

@tatiana tatiana changed the title WIP: Fix rendering DAGs with tests with multiple parents (AFTER_EACH & BUILD) WIP: Fix rendering DbtDags with tests with multiple parents (AFTER_EACH & BUILD) Dec 27, 2024
@tatiana tatiana changed the title WIP: Fix rendering DbtDags with tests with multiple parents (AFTER_EACH & BUILD) WIP: Fix rendering tests with multiple parents (AFTER_EACH & BUILD) Dec 27, 2024
@tatiana tatiana changed the title WIP: Fix rendering tests with multiple parents (AFTER_EACH & BUILD) WIP: Fix rendering dbt tests with multiple parents (AFTER_EACH & BUILD) Dec 27, 2024
@tatiana tatiana changed the title WIP: Fix rendering dbt tests with multiple parents (AFTER_EACH & BUILD) WIP: Fix rendering dbt tests with multiple parents Dec 27, 2024
@tatiana tatiana changed the title WIP: Fix rendering dbt tests with multiple parents Fix rendering dbt tests with multiple parents Dec 27, 2024
@tatiana tatiana marked this pull request as ready for review December 27, 2024 16:56
@dosubot dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Dec 27, 2024
@tatiana tatiana added this to the Cosmos 1.8.1 milestone Dec 27, 2024
@dosubot dosubot bot added area:rendering Related to rendering, like Jinja, Airflow tasks, etc dbt:test Primarily related to dbt test command or functionality labels Dec 27, 2024
Copy link

codecov bot commented Dec 27, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.94%. Comparing base (050ecd4) to head (5d0cece).
Report is 1 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #1433      +/-   ##
==========================================
+ Coverage   96.92%   96.94%   +0.02%     
==========================================
  Files          73       73              
  Lines        4320     4350      +30     
==========================================
+ Hits         4187     4217      +30     
  Misses        133      133              

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Copy link
Contributor

@pankajkoti pankajkoti left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Nice support added.

cosmos/airflow/graph.py Show resolved Hide resolved
@pankajastro pankajastro mentioned this pull request Dec 30, 2024
@tatiana tatiana merged commit 7dc9411 into main Dec 30, 2024
69 checks passed
@tatiana tatiana deleted the tests-multiple-parents branch December 30, 2024 13:59
pankajastro added a commit that referenced this pull request Dec 30, 2024
1.8.1 (2024-12-30)
--------------------

Bug Fixes

* Fix rendering dbt tests with multiple parents by @tatiana in #1433
* Add ``kwargs`` param in DocsOperator method
``upload_to_cloud_storage`` by @pankajastro in #1422

Docs

* Improve OpenLineage documentation by @tatiana in #1431

Others

* Enable Docs DAG in CI leveraging existing CI connections by
@pankajkoti in #1428
* Install providers with airflow by @pankajkoti in #1432
* Remove unused docs dependency by @pankajastro in #1414
* Pre-commit hook updates in #1424 

---------

Co-authored-by: Tatiana Al-Chueyr <tatiana.alchueyr@gmail.com>
Co-authored-by: Pankaj Koti <pankajkoti699@gmail.com>
tatiana added a commit that referenced this pull request Jan 14, 2025
As part of #1433 (Cosmos 1.8.1), we introduced a bug: detached test nodes were being incorrectly displayed when using:
- `TestBehavior.NONE` (when no tests should be run)
- `TestBehavior.AFTER_ALL` (when all tests should be run by the end of the DAG, regardless of their type)

This PR solves this issue.

Closes: #1444
tatiana added a commit that referenced this pull request Jan 14, 2025
As part of #1433 (Cosmos 1.8.1), we introduced a bug: detached test nodes were being incorrectly displayed when using:
- `TestBehavior.NONE` (when no tests should be run)
- `TestBehavior.AFTER_ALL` (when all tests should be run by the end of the DAG, regardless of their type)

This PR solves this issue.

Closes: #1444
tatiana added a commit that referenced this pull request Jan 14, 2025
Since we introduced detached test tasks in #1433 (released in 1.8.0) users started facing issues due to very long task names exceeding Airflow's limits.

Example of stacktrace reported by user:
```
 Traceback (most recent call last):
  File "/home/airflow/.local/lib/python3.12/site-packages/airflow/models/baseoperator.py", line 968, in __init__
    validate_key(task_id)
  File "/home/airflow/.local/lib/python3.12/site-packages/airflow/utils/helpers.py", line 55, in validate_key
    raise AirflowException(f"The key has to be less than {max_length} characters")
airflow.exceptions.AirflowException: The key has to be less than 250 characters
```

This PR fixes this issue. In case the name exceeds Airflow's limit (250 ATM), it will name the detached test using:
- "detached_{incremental unique number}_test"

Closes: #1440
tatiana added a commit that referenced this pull request Jan 15, 2025
…1463)

As part of #1433 (Cosmos 1.8.1), we introduced a bug: detached test
nodes were being incorrectly displayed when using:
  - `TestBehavior.NONE` (when no tests should be run)
- `TestBehavior.AFTER_ALL` (when all tests should be run by the end of
the DAG, regardless of their type)
   
 This PR solves this issue.
    
 Closes: #1444
tatiana added a commit that referenced this pull request Jan 15, 2025
Since we introduced detached test tasks in #1433 (released in 1.8.0) users started facing issues due to very long task names exceeding Airflow's limits.

Example of stacktrace reported by user:
```
 Traceback (most recent call last):
  File "/home/airflow/.local/lib/python3.12/site-packages/airflow/models/baseoperator.py", line 968, in __init__
    validate_key(task_id)
  File "/home/airflow/.local/lib/python3.12/site-packages/airflow/utils/helpers.py", line 55, in validate_key
    raise AirflowException(f"The key has to be less than {max_length} characters")
airflow.exceptions.AirflowException: The key has to be less than 250 characters
```

This PR fixes this issue. In case the name exceeds Airflow's limit (250 ATM), it will name the detached test using:
- "detached_{incremental unique number}_test"

Closes: #1440
tatiana added a commit that referenced this pull request Jan 15, 2025
Since we introduced detached test tasks in #1433 (released in 1.8.0) users started facing issues due to very long task names exceeding Airflow's limits.

Example of stacktrace reported by user:
```
 Traceback (most recent call last):
  File "/home/airflow/.local/lib/python3.12/site-packages/airflow/models/baseoperator.py", line 968, in __init__
    validate_key(task_id)
  File "/home/airflow/.local/lib/python3.12/site-packages/airflow/utils/helpers.py", line 55, in validate_key
    raise AirflowException(f"The key has to be less than {max_length} characters")
airflow.exceptions.AirflowException: The key has to be less than 250 characters
```

This PR fixes this issue. In case the name exceeds Airflow's limit (250 ATM), it will name the detached test using:
- "detached_{incremental unique number}_test"

Closes: #1440
tatiana added a commit that referenced this pull request Jan 15, 2025
Since we introduced detached test tasks in #1433 (released in 1.8.0),
users started facing issues due to very long task names exceeding
Airflow's limits.
    
Example of Python traceback reported by user:
```
     Traceback (most recent call last):
      File "/home/airflow/.local/lib/python3.12/site-packages/airflow/models/baseoperator.py", line 968, in __init__
        validate_key(task_id)
      File "/home/airflow/.local/lib/python3.12/site-packages/airflow/utils/helpers.py", line 55, in validate_key
        raise AirflowException(f"The key has to be less than {max_length} characters")
    airflow.exceptions.AirflowException: The key has to be less than 250 characters
```
    
This PR fixes this issue. In case the name exceeds Airflow's limit (250
ATM), it will name the detached test using:
- "detached_{incremental unique number}_test"

We also considered naming the new test using:
- "parent1_parent2_..._test" - but that may not solve the issue,
especially in circumstances where the same parents may have multiple
detached nodes. Perhaps in future, we could bundle them in a single
task.

Closes: #1440
tatiana added a commit that referenced this pull request Jan 15, 2025
Since we introduced detached test tasks to fix a customer issue in #1433
(release 1.8.1), we changed how Cosmos renders DAGs, with the chance
that Cosmos significantly changed how it renders a dbt project - even
when users did not change their `DbtDag` or `DbtTaskGroup`
configuration. This is unacceptable in a micro release - and for this
reason we're reverting this change and making the feature opt-in.

PR #1433 led to issues such as #1464, reported by multiple Cosmos users,
and also issues that did not become Github issues, such as an Astro
customer who reported that when they upgraded to Cosmos 1.8.1, the
number of tasks increased dramatically. One problem with #1433 was that
it did not empower users to opt in or out of having detached test nodes,
solving the problem for some but causing problems for many.

This PR aims to solve this problem by introducing a new property to
`RenderConfig`: `should_detach_multiple_parents_tests`. We are reverting
the Cosmos DAG rendering to what it was in 1.8.0 and before: by default,
it will not detach tests with multiple parents. Users must opt-in for
this behaviour if and when they want to.

We understand this may be perceived as a breaking change by some, but it
is the correct way to move forward and avoid causing further disruption.

We are planning to review the current implementation, as described in
#1469. For now, this PR documents the current behaviour and
empowers users.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area:rendering Related to rendering, like Jinja, Airflow tasks, etc dbt:test Primarily related to dbt test command or functionality size:L This PR changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Bug] Support associating tests to multiple parents, if they have multiple parents
3 participants